Variable binned scatter plots

نویسندگان

  • Ming C. Hao
  • Umeshwar Dayal
  • Ratnesh K. Sharma
  • Daniel A. Keim
  • Halldór Janetzko
چکیده

The scatter plot is a well-known method of visualizing pairs of two continuous variables. Scatter plots are intuitive and easy-to-use, but often have a high degree of overlap which may occlude a significant portion of the data. To analyze a dense non-uniform dataset, a recursive drill-down is required for detailed analysis. In this paper, we propose variable binned scatter plots to allow the visualization of large amounts of data without overlapping. The basic idea is to use a non-uniform (variable) binning of the x and y dimensions and to plot all data points that are located within each bin into the corresponding squares. In the visualization, each data point is then represented by a small cell (pixel). Users are able to interact with individual data points for record level information. To analyze an interesting area of the scatter plot, the variable binned scatter plots with a refined scale for the subarea can be generated recursively as needed. Furthermore, we map a third attribute to color to obtain a visual clustering. We have applied variable binned scatter plots to solve real-world problems in the areas of credit card fraud and data center energy consumption to visualize their data distributions and causeeffect relationships among multiple attributes. A comparison of our methods with two recent scatter plot variants is included.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visual analytics of large multidimensional data using variable binned scatter plots

The scatter plot is a well-known method of visualizing pairs of two-dimensional continuous variables. Multidimensional data can be depicted in a scatter plot matrix. They are intuitive and easy-to-use, but often have a high degree of overlap which may occlude a significant portion of data. In this paper, we propose variable binned scatter plots to allow the visualization of large amounts of dat...

متن کامل

Diagnostic checks for discrete-data regression models using posterior predictive simulations

Model checking with discrete data regressions can be dif®cult because the usual methods such as residual plots have complicated reference distributions that depend on the parameters in the model. Posterior predictive checks have been proposed as a Bayesian way to average the results of goodness-of-®t tests in the presence of uncertainty in estimation of the parameters. We try this approach usin...

متن کامل

Generalized scatter plots

' Corresponding author. Abstract Scatter Plots are one of the most powerful and most widely used techniques for visual data exploration. A well-known problem is that scatter plots often have a high degree of overlap, which may occlude a significant portion of the data values shown. In this paper, we propose th e generalized scatter plot technique, which allows an overlap-free representation of ...

متن کامل

Time-Segmented Scatter Plots: A View On Time-Dependent State Relations In Discrete-Event Time Series

Pairs of discrete event time series are characterised by their asynchronous nature, often hampering direct application of otherwise common analysis methods. For correct application of scatter plots, pairs of discrete event time series first have to be pre-processed and merged into a new synthetic time series of so-called coobservations. While standard scatter plots suggest analysis of global st...

متن کامل

Author's response to reviews Title:Admission Hypoxia-inducible Factor 1alpha Levels and In-hospital Mortality in Patients with Acute Decompensated Heart Failure Authors:

In truth, the authors have built the four scatter diagrams with regression line, but they have omitted to report the required regression equation for each of the drawn plots, i.e. within every plot the respective values of the intercept and beta-coefficient (slope) are lacking, in contrast with the good rule which provides for inclusion of the constant in equation, according to the scheme: Y( d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Information Visualization

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2010